Picture for Kevin Qinghong Lin

Kevin Qinghong Lin

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Add code
Jan 07, 2026
Viaarxiv icon

ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands

Add code
Dec 31, 2025
Viaarxiv icon

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Add code
Dec 18, 2025
Viaarxiv icon

Computer-Use Agents as Judges for Generative User Interface

Add code
Nov 19, 2025
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Figure 1 for Grounding Computer Use Agents on Human Demonstrations
Figure 2 for Grounding Computer Use Agents on Human Demonstrations
Figure 3 for Grounding Computer Use Agents on Human Demonstrations
Figure 4 for Grounding Computer Use Agents on Human Demonstrations
Viaarxiv icon

Paper2Video: Automatic Video Generation from Scientific Papers

Add code
Oct 06, 2025
Viaarxiv icon

Reinforcement Learning in Vision: A Survey

Add code
Aug 11, 2025
Viaarxiv icon

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Add code
May 27, 2025
Viaarxiv icon

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Add code
Mar 19, 2025
Viaarxiv icon